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Abstract 

The notion of a fc-automatic set of integers is well-studied. We develop a new 
notion — the fc-autoniatic set of rational numbers — and prove basic properties 
of these sets, including closure properties and decidability. 



1. Introduction 

Let k be an integer > 2, and let N = {0, 1,2, . . .} denote the set of non- 
negative integers. Let S/; = {0, 1, . . . , — 1} be an alphabet of k letters. Given 
a word w = aia2 - - -at G S^, we let [w\k denote the integer that it represents in 
base fc; namely, 



where (as usual) an empty sum is equal to 0. For example, [101011]2 = 43. 

Note that in this framework, every element of N has infinitely many distinct 
representations as words, each having a certain number of leM,ding zeroes. Among 
all such representations, the one with no leading zeroes is called the canonieal 
representation; it is an element of Ck '■= {e} U (Sfe — {0})!]^. For an integer 
n > 0, we let {n)k denote its canonical representation. Note that e is the 
canonical representation of 0. 
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(1) 



l<i<t 



Given a language i C S^, we can define the set of integers it represents, as 
follows: 

[L]k = { [w]k -.w&L}. (2) 

Definition 1. We say that a set S" C N is k-automatic if there exists a regular 
language L C such that S = [L\k- 

The class of fc-automatic sets of natural numbers has been widely studied 
(e.g., 0, y,Q), and many properties of these sets are known. For example, it 
is possible to state an equivalent definition involving only canonical representa- 
tions: 

Definition 2. A set 5 C N is k-automatic if the language 

{S)k ■■= {{n)k -.neS} 

is regular. 

To see the equivalence of Definitions [T] and [H note that if i is a regular 
language, then so is the language L' obtained by removing all leading zeroes 
from each word in L. 

A slightly more general concept is that of fc-automatic sequence. Let A be 
a finite alphabet. Then a sequence (or infinite word) (a„)„>o over A is said to 
be fc-automatic if, for every c 6 A, the set of fibers Fc ~ {n N : an — c} is 
a fc-automatic set of natural numbers. Again, this class of seq uences has been 
widely studied 0, 1^, Q . The following result is well-known 



IC 



Theorem 3. The sequence (a„)„>o is k-automatic iff its k-kernel, the set of its 
subsequences K ~ {(afe<!n+/)n>o '■ e > 0, < / < fc*^}, is of finite cardinality. 

In a previous paper [21] , the second author extended the notion of fc-automatic 
sets over N to sets defined over the set of non-negative rational numbers. 
In this paper, we will obtain some basic results about this new class. Our prin- 
cipal results are Theorem [T7] (characterizing the fc-automatic sets of rationals 
consisting entirely of integers). Corollary [531 (showing that fc-automatic sets of 
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rationals are not closed under intersection) , Theorem [52] (showing that it is de- 
cidable if a /c-automatic set of rationals is infinite) , and Theorem [33] (showing 
that it is decidable if a fc-automatic set of rationals equals N). 



2. Previous work 

The class of sets we study has some similarity to another class studied by 



Even 
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and Hartmanis and Stearns [l^; their class corresponds to the topo- 
logical closure of a subclass of our fc-automatic sets in which the denominators 
are restricted to powers of k. 

Yet another model of automata accepting real numbers was studied in 
. In this model real numbers are represented by their (possibly infinite) base- 
k expansions, and the model of automaton used is a nondeterministic Biichi 
automaton. However, even when restricted to rational numbers, their model 
does not define the same class of sets, as we will show below in Corollary [55] 



3. Representing rational numbers 

A natural representation for the non- negative rational number p/q is the pair 
{p,q) with q ^ 0. Of course, this representation has the drawback that every 
element of Q-*^ has infinitely many representations, each of the form {jp/d, jq/d) 
for some j > 1, where d — gcd(p, q). 

We might try to ensure uniqueness of representations by considering only 
"reduced" representations (those in "lowest terms"), which amounts to repre- 
senting p/q by the pair {p/d, q/d) where d = gcd(p, q). In other words, the only 
valid pairs are (p, g) with gcd(p, q) = 1. However, the condition gcd(p, g) = 1 
cannot be checked, in general, by finite or even pushdown automata — see Re- 
mark [55] below — and deciding if an arbitrary infinite regular language consists 
entirely of reduced representations is not evidently computable (see Section [5]). 
Furthermore, insisting on reduced representations rules out the representation 
of some "reasonable" sets of rationals, such as {(fc™ — l)/(fc" — 1) : m,n> 1} 
(see Theorem [22]) . For these reasons, we allow the rational number p/q 
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to be represented by any pair of nonnegative integers {p',q') with 
p/q = p' /q' , and we allow a rational number to be represented any nonzero 
number of times. 

Next, we need to see how to represent a pair of integers as a word over a 
finite alphabet. Here, we follow the ideas of Salon [l^, 13 13 ■ Consider the 
alphabet S^. A finite word w over can be considered as a sequence of pairs 
w = [fli, bi] [a2, 62] • • • [o-m bn] where ai,bi G Efe for 1 < i < n. We can now define 
the projection maps tti, 7r2, as follows: 

7ri(w) = oifla • • • a„; 7r2(t«) = 6162 ■■■K- 

Note that the domain and range of each projection are (S^)* and S^, re- 
spectively. Then we define [w]k — ([7ri(u))]fe, [7r2(w)]fc). Thus, for example, 
if u; = [1,0][0,1][1,0][0,0][1,1][1,0], then [w]2 = (43,18). We also define x, 
which allows us to join two words WjX € S^. of the same length to create a 
single word w x x € (S^)* whose tti projection is w and 7r2 projection is x. 

In this framework, every pair of integers {p, q) again has infinitely many dis- 
tinct representations, arising from padding on the left by leading pairs of zeroes, 
that is, by [0,0]. Among all such representations, the canonical representation 
is the one having no leading pairs of zeroes. We write it as (p, q)k- 

We now state the fundamental definitions of this paper: 

Definition 4. Given a word w G (S^)* with [TT2{w)]k 0, we define 

, X [7ri(w)]fc 
quOfe(w) := , , . 

If [7r2(u')]A; — 0, then quoj.(u') is not defined. Further, if [7r2(u))]fc for 
all w ^ L, then qu0j.(i) :— {qu0j,(w) : w G L}. A set of rational numbers 
S C is k-automatic if there exists a regular language L C (S^)* such that 
S = quofe(i). 

We reiterate that, in our definition, a given rational number a can have 
multiple representations in two different ways: it can have non-canonical repre- 
sentations that begin with leading zeroes, and it can have "unreduced" repre- 
sentations {p,q) where gcd(p, q) > 1. 
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Given a set S C if S contains a non- integer, then by calling it k- 

automatic, it is clear that we intend this to mean automatic in the sense of this 
section. But what if S* C N? Then calling it "automatic" might mean automatic 
in the usual sense, as in Section [TJ or in the extended sense introduced in this 
section, treating S* as a subset of In Theorem [TTl we will see that these two 
interpretations actually coincide for subsets of N, but in order to prove this, we 
need some notation to distinguish between the two types of representations. So, 
from now on by (N, fc)-automatic we mean the interpretation in Section [T] and 
by (Q-", fc)-automatic we mean the interpretation in this section. 

So far we have only considered representations where the leftmost digit is the 
most significant digit. However, sometimes it is simpler to deal with reversed 
representations where the leftmost digit is the least significant digit. In other 
words, sometimes it is easier to deal with the reversed word and reversed 
language instead of w and L, respectively. Since the regular languages are 
(effectively) closed under reversal, for most of our results it does not matter 
which representation we choose, and we omit extended discussion of this point. 

We use the following notation for intervals: I[a, /3] denotes the closed interval 
{x : a < X < P}, and similarly for open- and half-open intervals. 

4. Examples 

To build intuition, we give some examples of fc-automatic sets of rationals. 
Example 5. Consider the regular language 

Lo^{we (S^)* : miw) e Ck U {0} and [^aMlfc = !}• 

Then quOj,(Lo) = N, as Lq contains words with arbitrary numerator, but de- 
nominator equal to 1. 

Example 6. Let k ~ 2, and consider the regular language Li defined by the 
regular expression A*{[0, 1], [1, 1]} A*, where A = {[0, 0], [0, 1], [1, 0], [1, 1]}. This 
regular expression specifies all pairs of integers where the second component 
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has at least one nonzero digit — the point being to avoid division by 0. Then 
qu0j,(Li) = Q-*', the set of ah non- negative rational numbers. 

Example 7. Consider the regular language 

L2 = {we{El)* : 7r2(u;) e 0*1+0*}. 

Then we claim that quOj,(L2) = Q^^- To see this, consider an arbitrary non- 
negative rational number p/q with q > 1. Let i be the least nonnegative integer 
such that gcd(fc*,(7) ~ gcd(fc'+^,(7). Let d = gcd(fc*,(?) and write q — dq'; note 
that gcd(fc, g') — 1 and hence gcd(fc, (fc — ^)q') ~ 1- By the Fermat-Euler the- 
orem, there is an an integer j > 1 such that fc^ = 1 (mod (fc — l)?')- (Fo'" 
example, we can take j = ip{{k — !)(/'), where ip is Euler's totient function.) 
Define d' = Id and t = .f^} , : then 

p d'tp d'tp 

which expresses p/q as a number with denominator of the required form, with 
base-fc representation 1^ 0*. In Theorems [551 and 1551 below we will see that 
L2 achieves the minimum denominator density among all regular languages 
representing Q-". 

Example 8. For a word x and letter a let \x\a denote the number of occurrences 
of a in X. Consider the regular language 

L3 = {w e (S^) : |7ri(w)|i EE (mod 2) and |7r2(w)|i = 1 (mod 2)}. 

ThenitfollowsfromaresultofSchmidf2^thatquo2(L3) = Q^0-{2" : neZ}. 

Example 9. Let fc — 3, and consider the regular language L4 defined by the 
regular expression [0, 1]{[0,0], [2,0]}*. Then qu0j,(L4) is the 3-adic Cantor set, 
the set of all rational numbers in the "middle-thirds" Cantor set with denomi- 
nators a power of 3 [7|. 
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Example 10. Let k = 2, and consider the regular language L5 defined by the 
regular expression [0, 1]{[0, 0], [0, !]}*{[!, 0], [1, 1]}. Then the numerator encodes 
the integer 1, while the denominator encodes all integers that start with 1. 
Hence quOfe(L5) = {i : n>l}. 

Example 11. Let fc = 4, and consider the set S ~ {0, 1, 3, 4, 5, 11, 12, 13, . . .} of 
all non- negative integers that can be represented using only the digits 0, 1, —1 
in base 4. Consider the set Lq — {{p, (7)4 : p, g e 5*}. It is not hard to see that 



Lq is (Q-", 4)-automatic. The main result in [15| can be phrased as follows: 
qu04(Lg) contains every odd integer. In fact, an integer t is in quo4(Lg) if and 
only if the exponent of the largest power of 2 dividing t is even. 

Example 12. Let K^L be arbitrary regular languages over the alphabet E^.. 
Note that qaOf.{K \J L) — quOj.(_ft') U quOj.(L) but the analogous identity in- 
volving intersection need not hold. As an example, consider K — {[2,1]} and 
L = {[4,2]}. Then qyio^iK n L) = 7^ {2} = quOio(i^) n quOio(i)- 

5. Basic results 

In this section we obtain some basic results about automatic sets of rationals. 

Theorem 13. Let r, s he integers > 1. Then S is a k"^ -automatic set of rational 
numbers iff S is k'^ -automatic. 



Proof. Follows easily from the same result for automatic sequences [8|, |9| . □ 

Next we state a useful result from jgll: 

Lemma 14. Let 13 be a non-negative real number and define the languages 

L<0 = {xe(S^)* : quo,(a;) < /?}, 

and analogously for the relations <,=,>,>, ^. 

(a) If j3 is a rational number, then the language L<p (resp., L^p, L^p, L>^, 
Lyfj, L^p) is regular. 
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(b) If L<0 (resp., i</3, L>0, L^^) is regular, then (3 is a rational number. 



Suppose 5 is a set of real numbers, and a is a real number. We introduce 
the following notation: 

S + a := {x + a : X e S} 

S — a := {max(x — a,0) : x € S} 

a — S := {max(a — x,Q) : x G S} 

aS := {ax : x £ S}. 

Theorem 15. The class of k- automatic sets of rational numbers is closed under 
the following operations: 

(i) union; 

(ii) S^S + a for a G Q^°; 
(Hi) S -a foraG Q^°; 

(iv) S ^a-S for a G Q^°; 

(v) S-^aS fora& Q^°; 

(vi) S^{l/x : X gS\{0}}. 

Proof. We prove only item (ii), with the others being similar. We will use the 
reversed representation, with the least significant digit appearing first. Write 
a = p/q. Let M be a DFA with quoj.(L(M)) = S. To accept S + a, on input 
a base-A; representation of x = p'/q', we transduce the numerator to p'q — pq' 
and the denominator to qq' (hence effectively computing a representation for 
X — a), and simultaneously simulate M on this input, digit- by- digit, accepting 
if M accepts. □ 

The next theorem shows that if p/q has a representation in a regular lan- 
guage, then it has a representation where the numerator and denominator are 
not too large. Here we are using the least-significant-digit first representation. 
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Theorem 16. Let L C (S^)* be a regular language, accepted by an NFA M 
with n states. Ifp/q € quo^{L), there exists w G L with 7ri(w) = p', 7r2(w) = q', 
p'/q' = p/q, and p' , q' < A;P«". 

Proof. On input w representing the pair {p' ,q') we compute qp' and pq' simul- 
taneously, digit-by-digit, and test if they are equal. To carry out multiplication 
by q, we need to keep track of carries, which could be as large as g — 1, and 
similarly for p. An NFA M' to do this can be built using triples (z, j, q/), where 
i is a carry in the multiplication by p, j is a carry in the multiplication by q, 
and qi is the state in M arising from processing a prefix of w. Thus M' has 
pqn states. If M' accepts any word, then it accepts a word of length at most 
pqn — 1. From this the inequality follows. □ 

We now state one of our main results. 

Theorem 17. Let /S C N. Then S is {f^,k)- automatic if and only if it is 
(Q-°, k) -automatic. 

The proof requires a number of preliminary results. First, we introduce some 
terminology and notation. Wc say a set 5 C N is ultimately periodic if there 
exist integers no > 0,p > 1 such that n G S n + p G S, provided n > no. 

In particular, every finite set is ultimately periodic. 

Wc let V = {2, 3, 5, . . .} denote the set of prime numbers. Given a posi- 
tive integer n, wc let pd(n) denote the set of its prime divisors. For example, 
pd(60) = {2, 3, 5}. Given a subset DCV,wc let Tr{D) = {n > 1 : pd(n) C D}, 
the set of all integers that can be factored completely using only the primes in 
D. Finally, recall that i^fe(n) denotes the exponent of the largest power of k 
dividing n. 

First, we prove two useful lemmas. If g G N and S* C N, then by gS we mean 
the set {gs : s € S}, and hy g + S we mean the set {5 -f- s : s € S}. 

Lemma 18. Let 5 C N — {0}. Then the following are equivalent: 

(a) There exist an integer n > 0, and n integers gi > I, 1 < i < n, and n 
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ultimately periodic subsets Wi C N, 1 < i < n, such that 



l<i<n 



(b) There exist an integer m > 0, and m integers fi with kffi, 1 < i < m, 
and m ultimately periodic subsets Vi <^N, I < i < m, such that 

D fdk' : J e V,}, (3) 

l<i<m 

and the union is disjoint. 

(c) Define F — {^^^jy : s £ S*}. The set F is finite, and for all f G F, the 
set Uf — {j : f S} is ultimately periodic. 

Proof, (a) =^ (b): For each gi define Xi — Vk{gi) and fi = gi/k^'. Note that 
k!f,. Then 



^ = U 9^{k' ■■ J e W,} 

l<i<n 
l<i<n 
l<i<n 
\<i<n 

where W[ = Xi + Wi. Note that each W/ is ultimately periodic. If any of the 
fi coincide, we take the union of the corresponding and call it Vi. Since the 
union of a finite number of ultimately periodic sets is still ultimately periodic 
Q, we can choose a subset of the indices i so that each fi appears once, 
expressing S as 

U fdk' ■■ J e 

1 < z < m 

for some m < n. The union is now disjoint, for if (say) fik^ = f2k^ for j € Vi 
and j' G V2, then since /cj/i, /2 we must have j — j' and so fi — f2, which is a 
contradiction. 
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(b) =^ (c): 

Let s £ S. Then s = fik^ for some j. Since kjlfi, we have s/k^''^^^ = fi. 
Thus F — {fi : 1 < i < m} and hence is finite. By the disjointness of the 
union Q (and the fact that fc//i) we have fi G S j EVi. So C//^ = Vi 

and hence Vi is uhimately periodic. 

(c) (a): 
Let 

5ii 527 • • • 7 5n be distinct elements such that F — {gi, g2, ■ ■ ■ , gn}- Take 

If any of the conditions (a)-(c) above hold, we say that the set S is k-finite. 

Lemma 19. Let D he a finite set of prime numbers, and let S C t:{D). Let 
Si,S2,--- be an infinite sequence of (not necessarily distinct) elements of S . 
Then there is an infinite increasing sequence of indices ii < i2 < ■ ■ ■ such that 

Six I Si^ I ■ • ■ . 

Proof. Case 1: The sequence (si) is bounded. In this case infinitely many of the 
Si are the same, so we can take the indices to correspond to these s^. 

Case 2: The sequence (s^) is unbounded. In this case we prove the result 
by induction on If 1, then we can choose a strictly increasing 

subsequence of the (si); since all are powers of some prime p, this subsequence 
has the desired property. 

Now suppose the result is true for all sets D of cardinality t — 1. We prove 
it for \D\ — t. Since only t distinct primes figure in the factorization of the 
Si, some prime, say p, must appear with unbounded exponent in the (s^). So 
there is some subsequence of (s^), say (i^), with strictly increasing exponents of 
p. Now consider the infinite sequence (u;) given by Ui — ti/p''p^*'\ Each Ui has 
a prime factorization in terms of the primes in Z? — {p}, so by induction there 
is an infinite increasing sequence of indices ii, 12, ■ . ■ such that Ui-^ \ Ui^ \ ■ ■ ■ . 
Then p^p'^'^i^itij | p''p^*'2)itj^ | ...^ which corresponds to an infinite increasing 
sequence of indices of the original sequence (s^). □ 

We now state an essential part of the proof, which is of independent interest. 
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Theorem 20. Let D QV be a finite set of prime numbers, and let S C 7r(_D). 
Then S is k-automatic iff it is k-finite. 

Proof. <;=: If S is fc-finite, then by Lemma [T51 we can write it as the disjoint 
finite union 

l<i<m 

where each Vi is an ultimately periodic set of integers. Each term in the union 
has base-fc representation (/i)fe{0* : t & Vi} and hence is regular. It follows 
that S is fc-automatic. 

Suppose S is fc-automatic. Define F — {s/fc'^'''*^ : s £ S}. Suppose F 
is infinite. Clearly no element of F is divisible by fc. Therefore we can write S 
as the disjoint union {J.^^k^Hi, where Hi :— {f ^ F : k'^f G S}. Then there 
are two possibilities: either (a) the sets Hi are finite for all i > 0, or (b) at least 
one Hi is infinite. 

In case (a), define Ui := maxiJ^ for all i > 0. Then the set {uq, ui, . . .} must 
be infinite, for otherwise F would be finite. Choose an infinite subsequence of 
the Ui consisting of distinct elements, and apply Lemma 1191 Then there is an 
infinite increasing subsequence of indices ii < 12 < • ■ • such that Ui-^ \ Ui^ \ ■ ■ ■ . 
So the sequence (uij)j>i is strictly increasing. 

Now consider the characteristic sequence of S, say (/(n))„>o, taking values 
1 if n £ 5 and otherwise. Consider the subsequences (/,) in the fc- kernel of / 
defined by fj{n) — f{k^^n) for n > 0, j > 1. By our construction, the largest 
n in 7r(Z?) such that kfn and fj{n) — 1 is n — Ui-. Since the Ui. are strictly 
increasing, this shows the (infinitely many) sequences (/j) are pairwise distinct. 
Hence, by Theorem [3l / is not fc-automatic and neither is S. 

In case (b), we have Hi is infinite for some i. As mentioned above, S can 
be written as the disjoint union IJ^^q fc*i7i. Let L be the language of canonical 
base-fc expansions of elements of Hi (so that, in particular, no element of L 
starts with 0). The base-fc representation of elements of fc*i?i end in exactly 
i O's, and no other member of S has this property. Since S is assumed to be 
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fc-automatic, it follows that L is regular. Note that no two elements of Hi have 
a quotient which is divisible by fc, because if they did, the numerator would be 
divisible by k, which is ruled out by the condition. 

Since L is infinite and regular, by the pumping lemma, there must be words 
u,v,w, with V nonempty, such that uv^w e L for all j > 0. Note that for all 
integers j > and c > we have 

[uv'+''w]k = [uv^w]k ■ A:^!"^! + {[v^w]k - [w], ■ fc^l^l). (4) 

Let D — {pi,p2, ■ ■ ■ ,pt}- Since [uv^w]k G 5 C 7r(£)), it follows that there exists 
a double sequence {fr,j)i<r<t;j>i of non-negative integers such that 

[uv^w]k=pi'-' ---pt' (5) 

for all j > 0. From ©, we see that /fcl^l+Jl"! < pf '^ ■■■pt\ and hence 
(assuming pi < p2 < • • • < Pt) we get 

(|-uw| + j>|)logfc < ^ fr.Alogpt- 

\l<r<t J 

Therefore, there are constants < ci and J such that cij < J2i<r<t fr,j for 
J>J- 

For each j > J now consider the indices r such that f,.j > cij/t; there 
must be at least one such index, for otherwise fj.j < cij /t for each r and hence 
Si<r <t /'•■J — "^1.?' ^ contradiction. Now consider t+1 consecutive j's; for 
each j there is an index r with fr,j > cij/i, and since there are only t possible 
indices, there must be an r and two integers I' and I, with < I < I' < t, with 
fr.j+i > ci{j + l)/t and fr.j+i' > ci(j + l')/t. This is true in any block of t + 1 
consecutive j's that are > J. Now there are infinitely many disjoint blocks of 
t+l consecutive j's, and so there must be a single r and a single difference I' — I 
that occurs infinitely often. Put S = I' — I. 

Now use (ID) and take c — S. We get infinitely many j such that 
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where E — [v^ivik ~ [w]k ■ k^^""^ is a constant that is independent of j. Now focus 
attention on the exponent of pr on both sides. On the left it is frj+s, which 
we know to be at least ci(j + S)/t. On the right the exponent of pr dividing 
the first term is frj + 6\v\er (where k — p^^ ■ ■ -Pt*); this is at least cij /t. So p'^ 
divides E, where h > cij/t. But this quantity goes to oo, and E' is a constant. 
So E ^0. But then 

which is impossible, since, as we observed above, two elements of Hi cannot 
have a quotient which is a power of k. This contradiction shows that cannot 
be infinite. 

So now we know that F is finite. Fix some f € F and consider Tf = 
{fc* : fc'/ G S}. Since S is fc-automatic, and the set of base-A: expansions {Tf)k 
essentially is formed by stripping off the bits corresponding to {f)k from the 
front of each element of 5* of which {f)k is a prefix, and replacing it with "1", 
this is just a left quotient followed by concatenation, and hence {Tf)k is regular. 
Let M' be a DFA for {Tf)k, and consider an input of 1 followed by I O's for 
^ = 0, 1, ... in M'. Evidently we eventually get into a cycle, so this says that 
Uf — {i : k^ f G S} is ultimately periodic. Hence S is fc-finite. 

This completes the proof of Theorem [20l □ 

We can now prove Theorem II 71 

Proof. One direction is easy, since if S is (N, fc)-automatic, then there is an 
automaton accepting {S)k- We can now easily modify this automaton to accept 
all words over (S^)* whose 7r2 projection represents the integer 1 and whose tti 
projection is an element of {S)k- Hence S is (Q-*', fc)-automatic. 

Now assume S* C N is (Q-", fc)-automatic. If S is finite, then the result is 
clear, so assume S is infinite. Let i be a regular language with q\iOf^{L) = S. 
Without loss of generality we may assume every representation in L is canonical; 
there are no leading [0, 0]'s. Furthermore, by first intersecting with L^q we may 
assume that L contains no representations of the integer 0. Finally, we can 
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also assume, without loss of generality, that no representation contains trailing 
occurrences of [0, 0], for removing all trailing [0, 0]'s from all words in L preserves 
regularity, and it does not change the set of numbers represented, as it has the 
effect of dividing the numerator and denominator by the same power of k. Since 
the words in L represent integers only, the denominator of every representation 
must divide the numerator, and hence if the denominator is divisible by k, the 
numerator must also be so divisible. Hence removing trailing zeroes also ensures 
that no denominator is divisible by k. Let M be a DFA of n states accepting L. 

We first show that the set ['K2{L)]k of possible denominators represented by L 
is finite. Write S ^ Si^J S2, where Si = S n/[0, fc"+i) and 82 = 8 n/[A:"+i, 00). 
Let Li = L O L^i^n+i, the representations of all numbers < fc"+^ in L, and 
L2 = L n i>j,n+i. Both Li and L2 are regular, by Theorem [T4l It now suffices 
to show that 5*2 is (N, fc)-automatic. 

Consider any t G 82- Let z G L2 be a representation of t. Since t > k^~^^, 
clearly \z\ > n, and so t^2{z) must begin with at least n O's. Then, by the 
pumping lemma, we can write z = uvw with \uv\ < n and |f | > 1 such that 
uv^w G L for all i > 0. However, by the previous remark about 712(2), we see 
that 7r2(w) — 0^ for 1 < j < n. Hence [7r2(2;)]fe = [T^2{uvw)]k = [Tr2{uw)]k- 
Since uw must also represent a member of 8, it must be an integer, and hence 
['^2{z)]k I [TTi{uw)]k as weU as [Tr2{z)]k \ [Triiuvw)]k. Hence 

['^2{z)]k I [7Tl{uVw)]k - [TTl{uw)]k {[TTl{uv)]k - [7ri(u)]fc) • fc'""'. 

The previous reasoning applies to any 2 G i2- Furthermore, < [TTi{uv)]k ~ 
[7ri(M)]fe < /c". It follows that every possible denominator d of elements in L2 
can be expressed as d = di • ^2, where 1 < di < k" and d2 \ k"^ for some m. It 
follows that the set of primes dividing all denominators d is finite, and we can 
therefore apply Theorem!^ Since k divides no denominator, the set of possible 
denominators is finite. 

We can therefore decompose L2 into a finite disjoint union corresponding to 
each possible denominator d. Next, we use a finite-state transducer to divide the 
numerator and denominator of the corresponding representations by d. For each 
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d, this gives a new regular language Ad where the denominator is 1. Writing 
T := [Jd^d, we have 52 = quOj,(r) = [J ^ quo Ad). Now we project, throwing 
away the second coordinate of elements of T; the result is regular and hence S 
is a fc-automatic set of integers. □ 

Corollary 21. Let L C (S^.)* be a regular language of words with no leading or 
trailing [0,0] 's, and suppose qu.Oj^{L) C N. Then L can (effectively) be expressed 
as the finite union 

y {{mai,m)k : m e 5,} U (J {{bjn,bj)k : n^Tj}, 

l<i<A i<j<B 

where A, B > are integers, and Si, S2, ■ ■ ■ , Sa and Ti,T2, ■ ■ - Tb are (N, fc)- 
automatic sets of integers, and ai,a2, ■ ■ ■ ,aA and bi,b2, ■ ■ ■ ,bB are non- negative 
integers. 

Proof. Following the proof of Theorem [iTl we see that L can be written as the 
union of Li, the representations of integers < k"~^^ and L2, the representations 
of those > For each integer a < we can consider the words of 

L whose quotient gives a; this provides a partition of Li into regular subsets 
corresponding to each a, and gives the first term of the union. For L2, the 
proof of Theorem [T7] shows that there are only a finite number of possible 
denominators, and the set of corresponding numerators are fc-automatic. □ 

As corollaries, we get that the fc-automatic sets of rationals are (in contrast 
with sets of integers) not necessarily closed under the operations of intersection 
and complement. 

Theorem 22. Let Si = {(fc" - l)/(fc™ - 1) : 1 < m < n} and S2 = N. Then 
Si and S2 are both k- automatic sets of rationals, but Si H ^2 is not. 

Proof. We can write every element of Si as p/q, where p — (fc" — — 1) 

and q = (fc™ — l)/(fc — 1). The base-fc representation of p is 1" and the base-fc 
representation of q is 1™. Thus a representation for Si is given by the regular 
expression [1, 0] + [l, 1]+. We know that N is fc-automatic from Example [3 
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From a classical result wc know that {k™ — 1) | (fc" — 1) if and only if m | n. 
It follows that Si n 5*2 = T, where 

T = {(fc" - l)/(fc™ - 1) : 1< 77i< n and TO I n}. 

If the fc-automatic sets of rationals were closed under intersection, then T would 
be (N, fc)-automatic. Writing n — md, we have 
fc" — 1 

^ fc(d-l)m + . . . _^ fcm _^ 

fc™ — 1 

whose base-fc representation is (10""^)'*"^. Hence {T)k = {(10'""^)''"^ : 
TO > > 1}. Assume this is regular. Intersecting with the regular language 
10*10*10*1 we get {10"10"10"1 : n > 1}. But a routine argument using the 
pumping lemma shows this is not even context-free, a contradiction. □ 

From this result we can obtain several corollaries of interest. 

Corollary 23. The class of {Q-'^ , k)- automatic sets is not closed under the 
operations of intersection or complement. 

Proof. We have just shown that this class is not closed under intersection. But 
since it is closed under union, if it were closed under complement, too, it would 
be closed under intersection, a contradiction. □ 

Corollary 24. Define a "normalization operation" N that maps a word w to 
its canonical expansion in lowest terms {p/d, q/d)k, where d — gcd{p,q) and 
p — TTi{w), q — n2{w), and define N{L) = {N{w) : w £ L}. Then the 
operation N does not, in general, preserve regularity. 

Proof. We give an example of a regular language L where N{L) is not even 
context-free. It suffices to take L = {Si)k, where Si is the set defined above in 
the statement of Theorem[211 Consider N{L)-, then we know from the argument 
above that Si n N = T, where 

T = {(fc" - l)/(fc" - 1) : 1< TO < n and TO I n}. 

Hence N{L) n {N)k = {T)k, but from the argument above we know that 
{T)k = {{lQ"'-^)'^-^l : TO > 1, d > 1} is not context-free. □ 
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Corollary 25. There is a k-automatic set of rationals whose hase-k expansions 
(as real numbers) are not accepted by any Buchi automaton. 

Proof. Define ^3 — {(fc™ — ~ 1) • 1 < m < n}; this is easily seen to be a 

fc-automatic set of rationals. However, the set of its base-fc expansions is of the 
form 

U 0.(0"-™ (fc-1)")"', 

0<m<Ti<oo 

where by x'^ we mean the infinite word xxx • • ■ . A simple argument using the 
pumping lemma shows that no Biichi automaton can accept this language. □ 

It follows that the class of /c-automatic sets of rational numbers we study in 



this paper is not the same as that studied in 

Remark 26. The technique above also allows us to prove that the languages 
Ld = {{p,q)k ■■ q\ p}, Lr ^ {{p,q)k ■■ gcd{p,q) > 1}, and Lg = {{p,q)k ■ 
gcd{p,q) — 1} are not context-free. 

For the first, suppose Ld is context-free and is accepted by a PDA Md. 
Consider a PDA M that on input a unary word x :— V\ n > 2, guesses a word 
of the form y Q"-"!" with 1 < a < n and simulates Md on x xy, accepting if 
and only if a; x y G Ld- Then M accepts the unary language {!" : n composite} 
which is well-known to be non-context-free [l^ Ex. 6.1, p. 141], a contradiction. 

Notice that Ld can be considered as the set of all possible rational represen- 
tations of N. 

For Lr, the same kind of construction works. 

For Lg, a more complicated argument is needed. First, we prove that the 
language Lc — {0*1-' : i,j>l and gcd{i,j) > 1} is not context-free. To see 
this, assume it is, use the pumping lemma, let n be the constant, and let p be 
a prime > n. Choose z = O^' l^' G Lc- Then we can write z — uvwxy where 
\vwx\ < n and |tia;| > 1 and uv^wx'^y £ L^ for all z > 0. Then v and x each 
contain only one type of letter, because otherwise uv^wx^y has I's before O's, a 
contradiction. There are three possibilities: 

(a) vx = 0'^, 1 < r < n; 
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(b) vx ^ 1", 1 < s < n; 

(c) V = 0^ and x — 0'^ , I < r, s < n. 

In case (a) we pump with i = 2, obtaining uv'^x'uP'y — 0^ +'^\p _ But p is a 
prime, so for this word to be in Lc we must have r = (mod p), contradicting 
the inequality 1 < r < n < p. 

Similarly, in case (b) we get 0'' 1^+''. But l<s<p, sop<p + s< 2p. 
Hence p does not divide p + s, so gcd(p^,p + s) = 1, a contradiction. 

Finally, in case (c) and pumping with i = j + 1 we get {)P^+'^i\P+'^i , Since 
p is a prime, and 1 < r, s < p, by Dirichlet's theorem there are infinitely many 
primes of the form p^ + rj and p + sj. If r > s, choose j so that p^ + rj is a 
prime. Since p'^ + rj > p + sj, we have gcd(p^ + rj, p + sj) = 1, a contradiction. 

If r < s, choose j > p^ — p such that p + sj is prime. Then p + j > p^, so by 
adding (s — l)j to both sides we get p + sj > p^ + [s — l)j > p^ + rj. Thus p + sj 
is a prime greater than p^ + rj and so gcd(p^ + rj,p + sj) = 1, a contradiction. 
This completes the proof. 

From this we also get that 

Le = {On^ : gcd(i,j) = l} 

is not context free, since it is known that the class of context-free languages 
that are subsets of 0*1* are closed under relative complement with 0*1* |12| . 

We can now prove that Lg = {{p,q)k ■ gcd(p, g) = 1} is not context-free. 
Suppose it is. Then, since the CFL's are closed under intersection with a regular 
language, L' := Lg f] ([1, 0]* [1, 1]+ U [0, 1]*[1, 1]+) is also context-free. But 
the numerators are numbers of the form (fc" — l)/(fc — 1) and the denominators 
are numbers of the form (fc™ — l)/(fc — 1). From a classical result, we know that 
gcd(fc™ - 1, fc" - 1) = fcg<=d(m,«) _ I gQ follows that 

L' = {[1,0]"[1,1]* : gcd(s-|-t,t) = 1 and s > 0,^ > 1} 
U {[0,1]^[1,1]* : gcd(s-|-i,t) = 1 and s > 0,t > 1}. 
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Now apply the morphism h that maps [a, b] to a + 6 — 1. Since the CFL's are 
closed under morphism, we get 

h{L') = {0™1" : gcd(TO,n) = 1 and TO,n > 1}. 

But we already proved this is not context-free, a contradiction. Hence Lg is not 
context-free. 

6. Solvability results 

In this section we show that a number of problems involving fc-automatic 
sets of integers and rational numbers are recursively solvable. 

Theorem 27. The following problems are recursively solvable: given a DFA 
M , a rational number a, and a relation <\ chosen from 7^, <, <, >, >, does 
there exist x G quOj,(L(M)) with x <i a? 

Proof. The following gives a procedure for deciding if a: <] a. First, we create a 
DFA M' accepting the language L<|q as described in Theorem [T4l above. Next, 
using the usual direct product construction, we create a DFA M" accepting 
L{M) n L^a- Then, using breadth-first or depth-first search, we check to see 
whether there exists a path from the initial state of M" to some final state of 
M". □ 

Lemma 28. Let M be a DFA with input alphabet and let F C Q-^ be a 

finite set of non-negative rational numbers. Then the following problems are 
recursively solvable: 

1. Is F Q quOfe(L(Af))? 

2. Is qi\o^{L{M)) C F? 

Proof. To decide if F C quOj,(L(Af)), we simply check, using Theorem [T^ 
whether x £ qu0;.(i(M)) for each x G F. 

To decide if q\xOf.{L{M)) C F, we create DFA's accepting L=x for each 
x £ F, using Theorem [T4l Now we create a DFA accepting all representations 
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of all elements of F using the usual "direct product" construction for the union 
of regular languages. Since F is finite, the resulting DFA A is finite. Now, using 
the usual direct product construction, we create a DFA accepting L{A) — L{M) 
and check to see if its language is nonempty. □ 

Theorem 29. The following problem is recursively solvable: given a DFA M , 
and an integer k, is qu0;.(i(Af)) of infinite cardinality? 

Note that this is not the same as asking whether L{M) itself is of infinite 
cardinality, since a number may have infinitely many representations. 
First, we need a useful lemma from j^Jl. 

Lemma 30. Let u,v,w G (S^.)* such that \v\ > 1, and such that [Ki{uvw)]k 
and [Tr2{uvu))]k are not both 0. Define 



U := < 



quOkiw), if [Ki{uv)]k = [vr2(-uw)]fe = 0; 

oo, if [Ki{uv)]k > and [TT2iuv)]k — 0; (6) 

otherwise. 

|7r2(«t>)Jfc-|7r2(«)]fc ' 



(a) Then exactly one of the following cases occurs: 

(i) qvLOk{uw) < quOi^{uvw) < quOi,(uu^u)) <••■<[/; 
(a) quOj.(uw) — qaOk{uvw) = quo^iuv'^w) = ■ • ■ = U ; 
(Hi) quOkiuw) > quOj,(uww) > quOj,(ui'^w) >•••>[/. 

(b) Furthermore, limi_i.oo quo^,(ui;*w) = U. 

Now we can prove Theorem [29l 

Proof. Without loss of generality, we may assume that the representations in 
M are canonical (contain no leading [0,0]'s). Define 

[Tri{uv)]k - [TTliu)]k 



['K2{uv)\k - [7r2(u)]fc ' 
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and let pref (L) denote the language of all prefixes of all words of L. Let n be 
the number of states in AI. We claim that quo^, (L(i\f)) is of finite cardinality 
if and only if quOj,(L(Af)) C T, where 

T = {quOj.(a;) : x G L{M) and < n} U 

{7fe(M, w) : uv G pref(L) and > 1 and \uv\ < n}. (7) 

One direction is easy, since if quOj,(L(M)) C T, then clearly quOj,(L(M)) is 
of finite cardinality, since T is. 

Now suppose quOj,(L(M)) C T, so there exists some x € L{M) with qnOf,{x) ^ 
T . Since T contains all words of L{M) of length < rt, such an x is of length > n. 
So the pumping lemma applies, and there exists a decomposition x — uvw with 
< n and |f | > 1 such that uv''w e L for alH > 0. Now apply Lemma [501 
If case (b) of that lemma applies, then quOj,(x) — 7fe(u,w) G T, a contradic- 
tion. Hence either case (a) or case (c) must apply, and the lemma shows that 
quOj,(uu*i/7) for i>Q gives infinitely many distinct elements of quOj,(L(Af)). 

To solve the decision problem, we can now simply enumerate the elements 
of T and use Lemma [28l □ 

Theorem 31. Given p/q G Q-*', and a DFA M accepting a k-automatic set of 
rationals S, it is decidable if p/q is an accumulation point of S . 

Proof. The number a is an accumulation point of a set of real numbers S if and 
only if at least one of the following two conditions holds: 

(i) a = sup( S* n /(— cxD,a)); 

(ii) a = inf ( S* fl /(a, cxd)). 

By Theorem [Ml we can compute a DFA accepting S" 5 fl /(— oo,a) (resp.. 



S n /(a, oo)). By 2l|, Thm. 2] we can compute supS" (resp., inf 5'). □ 



Theorem 32. Suppose S is a k-automatic set of integers accepted by a finite 
automaton M . There is an algorithm to decide, given M , whether there exists 
a finite set D ^ V such that S C n(D). Furthermore, if such a D exists, we 
can also determine the sets F and Uf in Theorem \2(A 
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Proof. To determine if such a D exists, it suffices to remove all trailing zeroes 
from words in {S)k and see if the resulting language is finite. If it is, we know 
F, and then it is a simple matter to compute the Uf. □ 

Theorem 33. There is an algorithm that, given a DFA Mi accepting Li C 
(S^.)*, will determine i/quOj,(Li) C N. If so, the algorithm produces an au- 
tomaton M2 such that [L{M2)\k — quOj,(Li). 

Proof. We start with the algorithm to determine if quOj,(Li) C N. 

Modify Ml, if necessary, to accept only representations of nonzero numbers, 
and to accept the remaining words of Li stripped of leading and trailing [0, 0]'s. 
Let Ml have n states. 

1. Create, using Theorem [HI a DFA A/3 accepting a representation of the 
set Ti (quofc(ii) n /(O, fc"+i)) \ {0, 1, . . . , - 1}. If Ti ^ 0, then answer 
"no" and stop. 

2. Next, create a DFA M4 accepting the set T2 := (quOfc(i) n 00)). 
If any denominator ends in 0, answer "no" and stop. 

3. Compute TT2{L{Adi)) and, using Theorem [5^ decide if the integers so 
represented are factorable into a finite set of primes. If not, answer "no" and 
stop. 

4. Otherwise, compute the decomposition in Corollary [5T1 obtaining the 
finite set of denominators in that decomposition. Check whether each denomi- 
nator divides all of the corresponding numerators. If not, answer "no" and stop. 
Otherwise, answer "yes". 

To see that this works, note that if some non-integer a belongs to quOj.(Li), 
then either a < fc"+^ or a > k"^-^. In the former case we have Ti ^ 0, so this 
will be detected in step 1. 

Otherwise a > fc"+^. Then either the set of denominators of L2 do not 
factor into a finite set of primes (which will be detected in step 3), or they do. 
In the latter case the set of denominators is fc- finite, by Theorem [20] and so has 
a representation in the form given by Lemma [TBI (b). Now k cannot divide both 
a numerator and denominator, because have removed trailing [0, 0]s from every 
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representation. So k divides a numerator but not a denominator if and only 
this is detected in step 2. If steps 2 and 3 succeed, then, there are only finitely 
many denominators. 

Now quOj.(i) C N if and only if the numerators n corresponding to each of 
these finitely many denominators d are actually divisible by d. We can (effec- 
tively) form a partition of L2 according to each denominator. Using a machine 
to test divisibility by d, we can then intersect with each corresponding machine 
in the partition to ensure each numerator is indeed divisible. If so, we can easily 
produce an (N, fc)-automaton accepting quOj.(i). □ 

Corollary 34. There is an algorithm that, given a DFA Mi accepting Li C 
(E^)* and a DFA M2 accepting L2 C (Efe)*, will decide 

(a) ifq\iOf.{L) C [ialfc/ 

(b) ifqno^{L) = [Lalfc. 

Proof, (a) The algorithm is as follows: using the algorithm in the proof of 
Theorem [33l first determine if quOj.(Li) C N. If not, answer "no". If so, 
using the algorithm in that proof, we determine an automaton M such that 
qu0j,(Li) = [L(M)]fc. Finally, using the usual cross-product construction, we 
create an automaton M' that accepts L{M) \ ^2- If M' accepts anything, then 
answer "no" ; otherwise answer "yes" . 

(b) Similar to the previous case. In the last step, we create an automaton 
M' that accepts the symmetric difference (L(M) \ L2) \J (1^2 \ L{M)) . □ 

7. Denominator density 

The density of a language L C E* is the function f^-.N^N defined by 
fL{n) = |E" n L\, the number of distinct words of length n in L. 

The following is a natural question; suppose L is a language such that 
quO;.(L) = What is the smallest possible growth rate of the denomi- 

nators 7r2(i)? If no further restrictions on L are given, then it is easy to see 
that /l can grow arbitrarily slowly (say, by enumerating the rational numbers 
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and then finding arbitrarily large representations for each one). However, if L 
is restricted to be regular, then the best we can do is quadratic, as the following 
two results show. 

Theorem 35. If L is a regular language such that quO;,(i) = Q-'', then 7r2(L) 
(resp., Tri{L)) is not of density o(n^) . 

Proof. We prove the result for tt2', an analogous proof works for tti. 

Suppose there exists such a regular language L. Then by a theorem of Szilard 



et al. 



22| . we know that the density of tt2{L) must be 0{n). By another theorem 



in that same paper, we know that this implies that we can write tt2{L) as the 
finite union 

l<i<n 

where the Ui^Vi,Wi,Xi,yi are (possibly empty) finite words. 

Suppose every n2{uiVi), for 1 < i < n, contains a nonzero symbol. Then for 
every word z € L we would have quOj,(z) < fc*^, where M — maxi<i<„ 
and hence we could not represent arbitrarily large rational numbers, a contra- 
diction. It follows that there must be some nonempty subset S of the indices 
{1, 2, . . . , n} such that UiVi e 0* for all i G 5. 

Similarly, suppose every n2{xiyi), for i e 5, contains a nonzero symbol. Then 
for every word z represented by [Ji^sUiV*WiX*yi we would have h'k{T^2{z)) < 
N, where N — maxi<i<„ |a;ij/i|. But then we could not represent all rational 
numbers of the form (kp + l)/{qk^), where j > N and kp + 1 > k^^^^q, a 
contradiction. It follows that there must be some nonempty subset S' of S such 
that both UiVi and Xiyi are in 0* for all i G 5'. 

Now all rational numbers of the form (kp+l) / (qk^) with j > N and kp+1 > 
k^'i+iq must be representable by the words in IJiGS' '^i'^i'^i^lVi- Since UiVi and 
Xiyi are in 0* for each such term, it follows that the set T of all prime factors 
of denominators of all these words is finite (and consists of the prime factors of 
k and ■n2{wi) for i G S'). Choose any prime r ^ T and consider the rational 
number r'/{rk^), where r' is any prime with r' > rk^'^^^ . Then this number 
has no representation, a contradiction. □ 
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Theorem 36. For each k > 2, there exists a regular language L such that 
quo,(L) = Q^o, and f^,(L){n) = 0{n^). 

Proof. Such an example was given above as Example [71 We claim that there 
are exactly n{n + l)/2 words of length n in 7r2(L). To see this, note that the 
denominators are of the form 0*1^0*, and count the number of words of length n. 
There is 1 word consisting of all I's, 2 words with n — 1 consecutive I's, 3 words 
with n — 2 consecutive I's, and so forth, for a total of 1 + 2 + - ■ ■ + n = n(n + l)/2 
words. □ 



8. Open problems 

There are a number of open problems raised by this work. The most out- 
standing one is a generalization of Cobham's theorem Q] to the setting of k- 
automatic sets of rationals. 

Let r > 1 be an integer. We say a set A C is linear if there exist vectors 
Vq,Vi, . . . ,Vi such that 

A = {vQ+alVl^ V OiVi : ai, 02, . . . , G N}. 

We say a set is semilinear if it is the finite union of linear sets. 

Given a subset A C N'^, we can define its set of quotients q{A) to be {p/q : 
\p,q] e A}. 

Conjecture 37. 5 is a set of rational numbers that is simultaneously k- and 
Z-automatic for multiplicatively independent integers k,l > 2 if and only if there 
exists a semilinear set ^4 C such that S = q{A). 

One direction of this conjecture is clear, as given A we can easily build an 
automaton to accept the base-fc representation of the set of rational q{A). The 
converse, however, is not so clear. 

We now turn to some decision problems whose status we have not been able 
to resolve. Which of the following problems, if any, are recursively solvable? 

Given a DFA accepting L C representing a fc-automatic set of rationals 

S, 
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1. does S contain at least one integer? (alternatively: is there {p,q)k € L 
such that 9 I p?) 

2. does S contain infinitely many integers? 

3. are there infinitely many (p, q)k G L such that q \ p7 

4. is there some rational number p/q G S having infinitely many distinct 
representations in L7 

5. are there infinitely many distinct rational numbers p/q E S having in- 
finitely many distinct representations in L7 

6. do all rational numbers p/q G S have infinitely many distinct representa- 
tions in L7 

7. do all the {p, q)k G L have gcd(p, q) = 1? 

8. does any (p, q)k G L have gcd(p, q) = 17 

9. is max(p^-)j^_g^ gcd(p, g) bounded? 

In the particular case where S = Q-'^, we have the following conjecture. 

Conjecture 38. If L is a regular language with quOj,(L) = Q-''\ then L con- 
tains infinitely many distinct representations for infinitely many distinct rational 
numbers. 

We note that it is possible to construct a regular L with quOj,(_L) = Q^", 
yet there are infinitely many rational numbers with exactly one representation. 
For example, if = 2, we can take the language L3 of Example [S] and add back 
a single representation for each element of the set {2" : n G Z}. 

Here are some additional decision problems whose status (recursively solv- 
able or unsolvable) we have not been able to resolve: 

Given DFA's Mi and M2 accepting languages Li,L2 C representing sets 

1. is Si = S27 

2. is S1CS27 

3. is Si n 82^ 0? 
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We hope to address some of these questions in a future paper. 

Narad Ranipersad observes (personal communication) that arbitrary ques- 
tions involving divisibility and automatic sequences are recursively unsolvable. 
We are indebted to him for allowing us to quote his argument here: consider 
adjoining the divisibility relation | to the first-order theory of (N, +, V2), where 
Vfe is the function mapping its integer argument x to the greatest power of r 
dividing x. Using the divisibility relation one can define the unary predicate 
P3 consisting of the powers of 3: a number a; is a power of 3 if and only if for 
all integers y > 1 dividing x, the number y is itself divisible by 3. However, 
by a result of Bes we know that the first order theory of (N, +,V2,/3) is 
undecidable. 

As an example of the power of the divisibility relation added to our theory 
of automata, consider the following examples. 

Example 39. Consider the language iy C (S^)* defined by words with first 
component representing a numerator n of the form 2'' + 1 for i > 2 and denomi- 
nator an odd number d with I < d < n. This is regular, as it is specified by the 
regular expression [1, 0]X*[0, 1]X*[1, 1], where X = {[0, 1], [0, 0]}. Now consider 
the language 

L'e ■= {{p)2 ■■ P such that (^,9)2 G L7}; 

this language represents the base-2 expansions of all Fermat primes. An al- 
gorithm for determining the cardinality of such expressions would be able to 
resolve the question of the existence of a Fermat prime greater than 2'^^ + 1. 

Example 40. Consider the language ig C (S2)* defined by words with first 
component representing a numerator n of the form 2* — 1 for i > 3 and denom- 
inator an odd number d with 1 < d < n. This is easily seen to be regular. Now 
consider the language 

^7 {(P)2 ■ P such that {p,q)2 e Ls}; 
this language represents the base-2 expansions of all Mersenne primes. An 
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algorithm to determine if such an expression is of infinite cardinaUty would 
resolve the question of the infinitude of the Mersenne primes. 
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